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Tuning one's shower in some hotels may turn into a challenging co- 
ordination game with imperfect information. The temperature sensitivity 
increases with the number of agents, making the problem possibly unlearn- 
able. Because there is in practice a finite number of possible tap positions, 
identical agents are unlikely to reach even approximately their favorite wa- 
ter temperature. Heterogeneity allows some agents to reach much better 
temperatures, at the cost of higher risk. 

JEL Classification Number: C02, C61, C62, C63, D70, D83 

Keywords: coordination, heterogeneity, adaptive learning, non-linear system, feed- 
back 

1 Introduction 

Taking a shower can turn into a painful tuning and retuning game when many 
people take a shower at the same time if the flux of hot water is insufficient. 
In this fascinating game, it is in the interest of everybody not only to reach an 
agreeable equilibrium temperature but also to avoid large fluctuations. These 
two goals are difficult to achieve because one inevitably not only has incomplete 
information about the behavior and personal preferences of the other bathers, 
but also about the non-linear intricacies of the plumbing system. 
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2 The Shower Temperature Problem 



The central issue of this paper is to find the conditions under which the agents 
are satisfied, which depends on the learning procedure and on its parameters. 

The need to depart from rational representative agents was forcefully voiced 
among others by Kirman (2006) and Brian Arthur, for instance in his El Farol 
bar problem (Arthur 1994), subsequently simplified as Minority Game (Challet 
and Zhang 1997, Challet, Marsili and Zhang 2005), from which we shall borrow 
some ideas concerning the learning mechanism. In these models, the agents try to 
behave maximally differently from each other, hence the need for heterogeneous 
agents. 

The Shower Temperature Problem is different in that the perfect equilibrium is 
obtained when all the agents behave exactly in the same optimal, unique way. A 
priori, it is a perfect example of a case where the representative agent approach 
applies fully. As we shall see, however, because in practice there is a maximum 
number of tap tuning settings, it may pay off to be heterogeneous with respect 
to the strategy sets. Therefore, the problem we propose in this paper is another 
example of a situation where heterogeneity is tempting because potentially bene- 
ficial. The intrinsic and strong non-linearity of the temperature response function 
prevents the use of the mathematical machinery for heterogeneous systems that 
successfully solved the Minority Game (Challet et al. 2005, Coolen 2005), the El 
Farol bar problem (Challet, Ottino and Marsili 2004) and the Clubbing problem 
(De Sanctis and Galla 2006). 

2 The Shower Temperature Problem 

One of the problems of poor plumbing systems is the interaction between the 
water temperatures of all the people taking a shower simultaneously. If one person 
changes her shower setting, she infiuences the temperature of all the other bathers. 
Cascading shower tuning and retuning may follow. A key issue is how people can 
learn from past temperature fiuctuations how to tune their own shower so as to 
obtain an average agreeable temperature T, and also to avoid large temperature 
fiuctuations. 

Some rudimentary shower systems allow only for one degree of freedom, the de- 
sired fraction of hot water in one's shower water, denoted by </> G [0, 1]. Assuming 
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that H and C denote the maximal fluxes of hot and cold water available to a 
shower, and that the total flux at this shower is constant, the obtained tempera- 
ture is equal to 

rpHTn + CTcjl - <P) 

where T^ and Tq denote the constant temperatures of hot and cold water. 

In the following, we shall consider the special case were H = C, Tc = 0, and 
Th = 1, which amounts to express T in Th units, i.e. to rescale T by Th, which 
leads to T = 0. 

The situation may become more complex however if many people take a shower 
at the same time. Indeed, it sometimes happens that altogether the bathers 
ask for a larger hot water flux than the plumbing system can provide, a feature 
more likely found in old-style youth hostels than in more upmarket hotels (hence 
the title). Assume that the total available hot water flux for all bathers together 
is H while the cold water flux available at each single shower is C = if. We 
denote by $ = YliLi 'Pi the total fraction of asked hot water. If $ > 1, each agent 
will only receive instead of 0j and the total flux of hot water she obtains is 
smaller than expected.^ Finally, agent i obtains 

where ^ = max(l, $). Clearly, Ti(0i = 0) = and Ti(</)i = 1) = 1. When $ < 1, 
this equation reduces to the no-interaction case Tj = 0j. Therefore, provided that 
$ > 1, the agents interact through the temperature they each obtain, that is, via 
$. Assuming no inter-agent communication, the global quantity $ is the only 
means of interaction. Therefore, this model is of mean-field nature. Henceforth, 
we consider the more involved case of interaction, i. e. $ > 1. 



The fraction of cold water in this case is still 1 — 0^, according to the agent's choice, since 
cold water is assumed to be unrestricted. 
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Figure 1: Individual temperature as a function of in the homogeneous case for 
increasing (from top to bottom). 

3 Tuning one's shower 



3.1 Equilibrium and sensitivity: the homogeneous case 

Before setting up the full adaptive agent model, we shall discuss the homogeneous 
case where = (f). 

Assuming that all the agents have the same favorite temperature (Tj = T < 1), 
they do not interact if < 1/T, in which case = T. IfA^> 1/T the equilibrium 
is reached when 

1/1 \ 

(3) 



1 I 



Hence, there is always a (p that satisfies everybody (for instance, setting T = 1/2 



leads to 



1 — 1/A^). In equilibrium each agent actually gets (peqH/ {N ■ 



C/N hot water instead of (peqH and thus a total water flux of C/N+{1 —(f)eq)C = 
C/{NT). Hence, indeed the desired temperature T is reached for every agent, 
but the total water flux per agent is quite small for large A^. 

The sensitivity of T to 0, defined as x = ^ = [i_|_jv(i_0)]2 is an increasing function 
of (f) and maximal at </> = 1 (a similar result also holds for Tj = ^__^_^([_^.-^ )- The 
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problem is that x(0eq) = NT"^ oc A^; therefore, as increases, tuning around 
0cq becomes more and more difficult, suggesting already that the agents might 
experience difficulties to learn how to tune their shower. Figure [1] illustrates this 
phenomenon: as increases, the region in which most of the variation of T 
occurs shrinks substantially. 

This problem is made worse by the fact that, in practice, there is only a finite 
number S'max of (ps that can be effectively used by the agents, mostly because 
of internal tap static friction — the larger the friction, the smaller the number of 
different achievable (j)s. Assuming that the resolution in is 6(j), or equivalently 
that 5 = 1/ {6(j)) values of (p are usable, it becomes impossible to tune one's shower 
if \T{(j)cq ± S(f)) — T| ~ x(0cq)(5</) is larger than some acceptable value. As x oc A^ 
around ^eq, 5* oc A^ is needed; as a consequence, the ideal temperature is not 
learnable beyond a number of agents, which is for a large part pre-determined by 
the plumbing system. 

3.2 Learning 

The question is how to reach 0cq- In this model, it is hoped that the agents 
have a common interest to avoid large fluctuations of Tj around their favorite 
temperature Tf. the Shower Temperature Problem is a repeated coordination 
game (cf. ? and ?) with many agents and limited information. 

The dynamics of the agents are fully determined by their possible tap settings, 
thereafter called strategies, and by the trust they have in them. Each agent i 
has S possible strategies (pi^s with s = 1,...,S chosen in [0,1] before the game 
begins and kept constant afterwards (how to choose the 0s is discussed in the 
next section). The typical resolution in is l/S*; for the same reason, the typical 
maximal 0j over all the agents is of order 1 — 1/5. This paper follows the road of 
inductive behavior advocated by Brian Arthur: to each possible choice (pi^s agent 
i attributes a score U.i^s{t) (where t denotes the time step of the game), which 
describes its cumulated payoff at time t. The agents choose probabilistically their 
(pi^s according to a logit model P{(pi{t) = (pi^s) = exp(rf/j_5(t))/Z, where Z is a 
normalization factor and F is the rate of reaction to a relative change of Ui^s- 

If one were to follow blindly El Farol bar problem and Minority Game literature. 
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one would write 



+ 1) = Ui,s{t) + Ti - T,(t) 



When S > 2, such payoffs are not suitable any more, as the agents switch be- 
tween their highest and smallest the intermediate ones being sometimes used 
only because of fluctuations induced by the stochastic strategy choice. A payoff 
allowing for a gradual increase of (pi^s is necessary. Absolute value-based payoffs 
are fit for this purpose^: mathematically, 



This payoff however does not depend on (j)i^s- As a consequence, all the strategies 
have the same payoff. Therefore, one has to give more information to the agents. 
An agent that has perfect information about the plumbing system, the temper- 
atures and fluxes of hot and cold water — for instance the plumber that built 
the whole installation — may know precisely which temperature she would have 
obtained, had she played (pi^s' instead of her chosen action <t>i^Si{t)- Such people 
are probably not very frequent amongst the general population, however. This is 
why we shall consider an in-between case, where the agents' estimation of Ti s{t) 
is a linear interpolation between the temperature of the strategy currently in use, 
i. e. Ti{t) = Ti^si{t) and its correct virtual value. The payoff is therefore 



where i] G [0, 1] encodes the ability of the agents to infer the influence of 
on the real temperature and < A < 1 introduces an exponential decay of 
cumulated payoffs, with typical score memory length oc 1/A. The parameter rj 
is related to the difference between naive and sophisticated agents as defined by 
(Rustichini 1999). The first kind of agents believe that they are faced with an 
external process, i.e. that they do not contribute to $, whereas sophisticated 
agents are able to compute $_j = $ — 0^. In this model, perfect sophisticated 
agents have 77 = 1. 



^Quadratic payoffs, albeit mathematically sound, are more problematic for performing numer- 
ical simulations. 



uut + i) = uut) 



Ui^sit + 1) = - A) - A T, - (1 - r,)Ti{t) - r,T,^s{t) , 



(4) 
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4 Results 

It is natural to measure two collective quantities, the average temperature T ob- 
tained by the agents and its average distance from ideal temperature averaged 
over all the agents, denoted by AT = T — T; this characterizes the average tem- 
perature obtained by the agents, or how far the agents are collectively from their 
goal. The individual dissatisfaction is the distance from the ideal temperature 
for a given agent; one therefore measures it with \5T\ = J2iLi \Ti — Tj|; it is a 
measure of the average risk. 

All the quantities reported here are measured in the stationary state over 10, 000 
time steps for T = 0.5, ?7 = 1, A = 0.001 and if not stated differently = 20, 
after an equilibration time of 30/ (AF). The stationary state does not depend 
much on A. On the other hand, the performance of the population is of course 
improved as r] increases and saturates for r] > 0.5. The role of T is discussed 
below. 

4.1 Homogeneous population 

Since the equilibrium is reached when all the agents tune their shower in exactly 
the same way, trying first homogenous agents (or equivalently a representative 
agent) makes sense a priori. We shall therefore set ipi^s = <Ps = s = 1, S 
so that the agents avoid using only hot or cold water. 

Agents with homogeneous strategies have a peculiar way of converging to their 
ideal temperature as S increases. Figure [2] displays the oscillations of the reached 
temperature with decreasing amplitude as a function of S. The asymmetric 
upward and downward slopes are due to the asymmetry of T around (p^q, as seen 
in Figure [H Theoretically, this can easily be explained by assuming that all the 
agents select the same s that gives T as close as possible to T. If s was a real 
number, s = [1 — 1/A^(1/T — 1)](5' + 1). The choice of the agents therefore is 
limited to [s] and [s] + 1 where [x] is the integer part of x (one may need to 
enforce [s] < S when S < N). T{[s]) and T{[s] + 1) are alternatively closest to 
T, therefore this actual optimal temperature Tth (whichever T([s]) or T([s] + 1)) 
oscillates around T, as seen in Figure [2l The period of the oscillations is A^, and 
their amplitude decreases as 1/5*. As expected, a very large value of F replicates 
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Figure 2: Temperature T reached by homogeneous agents as a function of 5* for 
various F. Inset: T vs. [S + 1)/N, showing the scahng property of T, 
with = 10,20,40 (asterisks, triangles, crosses). 

closely the dented nature of the value of Tth, in which case all the agents take 
the same choice even close to the peak of Tth. Generally, smaller Fs (at least 
to a certain degree) lead to better average temperatures as it allows to play 
mixed strategies, and thus combine two temperature so as to achieve a collective 
average result closest to T. From that point of view. F = 50 is a better choice 
than F = 1000. Hence, there exists an optimal global value of F, leading to a 
mixed-strategy equilibrium. This is because taking stochastic decisions is a way 
to overcome the rigid structure imposed on the strategy space, whose inadequacy 
is reinforced by the strong non-linearity of T(0). A too small F is detrimental as 
it allows for using (p further away from (p^q, because of the shape of T{(j)), those 
with smaller are more likely to be selected. 

The individual dissatisfaction \6T\ unsurprisingly mirrors | AT| since all the play- 
ers are identical. Both quantities are the same for large F as everybody plays 
the same fixed strategy. \6T\ also decreases as 1/S (see Figure [5]). However, the 
larger F, the smaller \ST\, as each agent manages to get closer to the optimal 
choice. 
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Figure 3: Temperature T reached by homogeneous agents as a function of S for 
r = 100. Squares: theory, circles: numerical simulations. Inset: aver- 
age deviation I from T versus F (same parameters); the dotted lines 
are for eye guidance only. 

It is easy to obtain analytical insights by solving the stationary state equations 
for Ui^s- For the sake of simplicity, assuming that 77 = 1 and that only the two 0s 
surrounding 0eq, i- e. [s] and [■§] + !, denoted by — and + respectively, are used, 
one obtains the set of equations (independent from A and i) 

Ui,± = U± = -\T±-f\ (5) 

where ^ 

with iV± = AT . P(. = ±), where P{s = +) = ^^.^^Zf+^^ru.,-) = = 

1 — P(s = +) is a Logit model for the two-strategy case S = 2. Figure [3] shows the 
good agreement between numerical simulations and this simple theory, especially 
in the convex part of the oscillations, as long as F is large enough (about 50) to 
prevent the use of more than 2 strategies. 

Being faced with oscillations is problematic since the agents do not know a 
priori and because may vary with time. In addition, since all the agents select 
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Figure 4: Absolute temperature deviation |AT| reached by homogeneous 
(squares) and heterogeneous (circles) agents as a function of S for 
r = 100. Average over 500 samples for heterogeneous agents. 

the same </> for large F, not a single agent is ever likely to reach a temperature 
close to T. The agents do not know whether on average they will overheat 
or chill. A way to measure this uncertainty is to measure the average |AT| 
over S in numerical simulations, for instance with / = YI^s^n |AT|/(4A^).^ The 
inset of Figure [3] reports that the minimum of / is at F ^ 42 for the chosen 
parameters, which shows the existence of an optimal learning rate. Since the 
individual satisfaction is maximal in the limit F — > oo (see above) there is no 
minimum of a similar measure for \6T\. 



4.2 Heterogeneous populations 

There are many ways for agents to be heterogeneous. One could imagine to vary 
S, F, rj, X OT T amongst the agents. Here we focus on strategy heterogeneity, 
i. e. the agents face showers with different tap settings: the strategy space of agent 

^Simulations show that the average temperature is in fact a function of (5 + 1)/N (cf. Figure 
[2]) (instead of a fimction of S and N), i. e. Figure [3] would look the same if S was fixed and 
N varied. Hence we may take the average over S instead of over N. 
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Figure 5: Individual dissatisfaction \5T\ reached by homogeneous (empty squares) 
and heterogeneous agents (full circles) as a function of S* for F = 1000. 
Average over 500 samples for heterogeneous agents. Dashed line: the- 
oretical predictions. 



i is no longer . . . , but now each agent has an individual strategy space 
where each strategy (pi^s, s = 1, . . . , S, is assigned a random number from the 
uniform distribution on [0, 1] before the simulation. 

Intuitively, the effect of heterogeneity is to break the structural rigidity of the 
strategy set of a representative agent. Figure H] reports that |AT| does not os- 
cillate, but converge (from below) faster than to zero. Homogeneous agents 
might achieve a better average temperature depending on and S, but on the 
whole clearly perform collectively worse. In addition, heterogeneous agents ex- 
pect to have a smaller than ideal temperature, but on average predictably smaller, 
with no strong dependence on S. Thus, the expectation over the temperature of 
the agents is much improved by heterogeneity. 

However, looking at the average absolute individual deviation from T reveals 
that the uncertainty brought by heterogeneity is considerably worse on average. 
Plotting \6T\ for both types of agents shows that \6T\ is always smaller for homo- 
geneous agents (Figure [5]). This means that if being heterogeneous is more risky. 
Which agent (or equivalently, shower) performs better depends not only on A^, 
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Figure 6: Fraction of the runs for which a single heterogeneous agent is worse 
off than the other — 1 homogeneous agents; F = 1000 (crosses) and 
F = 30 (circles). Average over 2000 samples. 

but also on the tuning settings of all the agents. 



5 Discussion and conclusions 

Heterogeneity may be tempting as it suppresses the systematic abrupt oscillations 
experienced by homogeneous populations and is collectively better on average. 
However, it seems that heterogeneous showers are potentially more risky. In 
other words, the agents must consider the trade-off between the temptation of an 
expected better temperature and a potentially larger deviation. 

The situation discussed above is only global. Does it pay to be heterogeneous for 
a single agent? An answer comes from a system consisting of A^ — 1 homogeneous 
agents as defined above and a single random one with random 0j,sS. The fraction 
/ of the runs at fixed S that give a better 6Ti to the homogeneous showers is 
reported in Figure [6l this quantity indicates that the majority of heterogeneous 
agents are not worse off for about a quarter of the values of S. This finding 
is not in contradiction with the fact that the average personal dissatisfaction of 
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heterogeneous agents is always larger than that of homogeneous agents: \6T\ is 
much influenced by large deviations contributed by a minority of agents because 
of large temperature sensitivity to small deviations in </>. Finally, the advantage 
of the homogeneous population increases with F, as a large learning rate helps 
only using one's best strategy. 

As a final note, minimizing |AT| is equivalent to solving a number partitioning 
problem (Garey and Johnson 1979) in which one splits a set of numbers Oj > 
into two subsets, so that the sums of the numbers in the subsets are as close as 
possible, which amounts to minimize C = \ "^^Siail where = ±1; it is an NP- 
complete problem; in other words, the only way to find the absolute minimum of 
C is to sample all the 2^ configurations. Let us consider an even simpler version 
of the Shower Temperature Problem that makes more explicit its NP-complete 
nature. Each agent i is given and plays 0eq + Sitti, Sj = ±1. Neglecting the 
self-impact on the resulting temperature and the non-linearity of the temperature 
response, the analogy between the Shower Temperature Problem and the num- 
ber partitioning problem is straightforward. Methods borrowed from statistical 
mechanics show that the average optimal C scales as , which requires to enu- 
merate the 2^ possible configurations (Mertens 1998). This is much better than 
what the agents achieve; the reason for this discrepancy is that the agents do 
not reach a stationary state in O(expA^) time steps, hence, they cannot sample 
all the possible configurations. Another reason is that the optimal solution may 
require some agents to use a strategy that would yield a worse temperature than 
their optimal choice. 

In conclusion, the Shower Temperature Problem shows the subtle trade-offs be- 
tween a homogeneous population with equally spaced actions and a fully random 
one. In a system where the agents' action space is not likely to include the opti- 
mal equilibrium choice, heterogeneity is a way to solve more robustly, with less 
systematic deviation this kind of problem, at the expense of a higher risk for 
individual agents. 
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